Dynamic Control Algorithm for Biped Walking Based on Policy Gradient Fuzzy Reinforcement Learning
نویسندگان
چکیده
This paper presents a novel dynamic control approach to acquire biped walking of humanoid robots focussed on policy gradient reinforcement learning with fuzzy evaluative feedback . The proposed structure of controller involves two feedback loops: conventional computed torque controller including impact-force controller and reinforcement learning computed torque controller. Reinforcement learning part includes fuzzy information about Zero-Moment Point errors. To demonstrate the effectiveness of our method, we apply it in simulation to the learning of a biped walking.
منابع مشابه
Episodic Reinforcement Learning Control Approach for Biped Walking
This paper presents a hybrid dynamic control approach to the realisation of humanoid biped robotic walk, focusing on the policy gradient episodic reinforcement learning with fuzzy evaluative feedback. The proposed structure of controller involves two feedback loops: a conventional computed torque controller and an episodic reinforcement learning controller. The reinforcement learning part inclu...
متن کاملFast biped walking with a reflexive controller and real-time policy searching
In this paper, we present our design and experiments of a planar biped robot (“RunBot”) under pure reflexive neuronal control. The goal of this study is to combine neuronal mechanisms with biomechanics to obtain very fast speed and the on-line learning of circuit parameters. Our controller is built with biologically inspired sensorand motor-neuron models, including local reflexes and not employ...
متن کاملFast Biped Walking with a Sensor-driven Neuronal Controller and Real-time Online Learning
In this paper, we present our design and experiments on a planar biped robot under the control of a pure sensor-driven controller. This design has some special mechanical features, for example small curved feet allowing rolling action and a properly positioned center of mass, that facilitate fast walking through exploitation of the robot’s natural dynamics. Our sensor-driven controller is built...
متن کاملPareto design of fuzzy tracking control based on the particle swarm optimization algorithm for a walking robot in the lateral plane on slope
Many researchers have controlled and analyzed biped robots that walk in the sagittal plane. Nevertheless, walking robots require the capability to walk merely laterally, when they are faced with the obstacles such as a wall. In walking robot field, both nonlinearity of the dynamic equations and also having a tracking system cause an effective control has to be utilized to address these problems...
متن کاملPoincaré-Map-Based Reinforcement Learning For Biped Walking
We propose a model-based reinforcement learning algorithm for biped walking in which the robot learns to appropriately modulate an observed walking pattern. Viapoints are detected from the observed walking trajectories using the minimum jerk criterion. The learning algorithm modulates the via-points as control actions to improve walking trajectories. This decision is based on a learned model of...
متن کامل